Visual Estimation of Attentive Cues in HRI: The Case of Torso and Head Pose
نویسندگان
چکیده
Capturing visual human-centered information is a fundamental input source for effective and successful human-robot interaction (HRI) in dynamic multi-party social settings. Torso and head pose, as forms of nonverbal communication, support the derivation people’s focus of attention, a key variable in the analysis of human behaviour in HRI paradigms encompassing social aspects. Towards this goal, we have developed a model-based approach for torso and head pose estimation to overcome key limitations in free-form interaction scenarios and issues of partial intraand inter-person occlusions. The proposed approach builds up on the concept of Top View Re-projection (TVR) to uniformly treat the respective body parts, modelled as cylinders. For each body part a number of pose hypotheses is sampled from its configuration space. Each pose hypothesis is evaluated against the a scoring function and the hypothesis with the best score yields for the assumed pose and the location of the joints. A refinement step on head pose is applied based on tracking facial patch deformations to compute for the horizontal offplane rotation. The overall approach forms one of the core component of a vision system integrated in a robotic platform that supports socially appropriate, multi-party, multimodal interaction in a bartending scenario. Results in the robot’s environment during real HRI experiments with varying number of users attest for the effectiveness of our approach.
منابع مشابه
Appearance-based Person Tracking and 3d Pose Estimation of Upper-body and Head
In the field of human-robot interaction (HRI), recognition of humans in a robot’s surroundings is a crucial task. Besides the localization, the estimation of a person’s 3D pose based on monocular camera images is a challenging problem on a mobile platform. For this purpose, an appearancebased approach, using a 3D model of the human upper body, has been developed end experimentally investigated....
متن کاملContinous Head Pose Estimation using Random Regression Forests
Head pose is a rich visual cue that finds great interest in the field of human robot interaction (HRI) and for video surveillance applications. Previous attempts at solving this problem have often proposed solutions formulated in a classification setting. Furthermore, strong assumptions on illumination and scale in an occlusion-free environment have usually been made. We propose a regression so...
متن کاملCamera Pose Estimation in Unknown Environments using a Sequence of Wide-Baseline Monocular Images
In this paper, a feature-based technique for the camera pose estimation in a sequence of wide-baseline images has been proposed. Camera pose estimation is an important issue in many computer vision and robotics applications, such as, augmented reality and visual SLAM. The proposed method can track captured images taken by hand-held camera in room-sized workspaces with maximum scene depth of 3-4...
متن کاملAre juvenile domestic pigs (Sus scrofa domestica) sensitive to the attentive states of humans?--The impact of impulsivity on choice behaviour.
Previous studies have shown that apes, dogs and horses seem to be able to attribute attentive states to humans. Subjects had to choose between two persons: one who was able to see the animal and one who was not. Using a similar paradigm, we tested a species that does not rely strongly on visual cues, the domestic pig (Sus scrofa domestica). Subjects could choose between two unfamiliar persons, ...
متن کاملRobust Attentive Behavior Detection by Non-linear Head Pose Embedding and Estimation
We present a new scheme to robustly detect a type of human attentive behavior, which we call frequent change in focus of attention (FCFA), from video sequences. FCFA behavior can be easily perceived by people as temporal changes of human head pose (normally the pan angle). For recognition of this behavior by computer, we propose an algorithm to estimate the head pan angle in each frame of the s...
متن کامل